Early Nested Word Automata for XPath Query Answering on XML Streams

نویسندگان

  • Denis Debarbieux
  • Olivier Gauwin
  • Joachim Niehren
  • Tom Sebastian
  • Mohamed Zergaoui
چکیده

Algorithms for answering XPath queries on Xml streams have been studied intensively in the last decade. Nevertheless, there still exists no solution with high efficiency and large coverage. In this paper, we introduce early nested word automata in order to approximate earliest query answering algorithms for nested word automata in a highly efficient manner. We show that this approximation can be made tight in practice for automata obtained from XPath expressions. We have implemented an XPath streaming algorithm based on early nested word automata in the Fxp tool. Fxp outperforms most previous tools in efficiency, while covering more queries of the XPathMark benchmark.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Streamable Fragments of Forward XPath

We present a query answering algorithm for a fragment of Forward XPath on Xml streams that we obtain by compilation to deterministic nested word automata. Our algorithm is earliest and in polynomial time. This proves the finite streamability of the fragment of Forward XPath with child steps, outermost-descendant steps, label tests, negation, and conjunction (aka filters), under the reasonable a...

متن کامل

Earliest Query Answering for Deterministic Nested Word Automata

Earliest query answering (eqa) is an objective of streaming algorithms for XML query answering, that aim for close to optimal memory management. In this paper, we show that eqa is infeasible even for a small fragment of XPath unless P=NP. We then present an eqa algorithm for queries and schemas defined by deterministic nested word automata (dnwas) and distinguish a large class of dnwas for whic...

متن کامل

Queries on Xml streams with bounded delay and concurrency

Query answering algorithms on Xml streams check answer candidates on the fly in order to avoid the unnecessary buffering whenever possible. The delay and concurrency of a query are two measures for the degree of their streamability. They count the maximal number of stream elements during the life time for some query answer, and respectively, the maximal number of simultaneously alive answer can...

متن کامل

Earliest Query Answering for Deterministic Streaming Tree Automata and a Fragment of XPath

We study the concept of earliest query answering as neededfor streaming XML processing with optimal memory man-agement. We derive lower complexity bounds showing thatearliest query answering for Forward XPath is not feasible inpolynomial time combined complexity except if P=NP. Wethen distinguish a fragment of Forward XPath with negationthat enjoys P-time earliest query ...

متن کامل

Querying Streaming XML Using Visibly Pushdown Automata

We present a novel solution to streaming XPath query evaluation, capable of supporting forward and reverse axes queries. We show a conversion of an XPath query to a Visibly Pushdown Automaton that preserves the semantics of the query. It was shown in [1] that such a construction results in an automation suitable for efficient streaming query evaluation. We also show that the constructed automat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013